Hardware-based performance monitoring with VTune Performance Analyzer under Linux
نویسنده
چکیده
All new modern processors have hardware support for monitoring processor performance. In this project, we try to explore use of VTune Performance Analyzer for hardware-based performance monitoring of a Linux cluster of Pentium 4 Xeon processors.
منابع مشابه
Demystifying Intel Branch Predictors
Improvement of branch predictors has been one of the focal points of computer architecture research during the last decade, ranging from two-level predictors to complex hybrid mechanisms. Most research efforts try to use real, already implemented, branch predictor sizes and organizations for comparison and evaluation. Yet, little is known about exact predictor implementation in Intel processors...
متن کاملMonitoring Linux with Native Tools
Linux is gaining interest as a solution across many hardware platforms: Intel based machines, Sun and Apple proprietary hardware and IBM zSeries platforms. But once applications are ported to an open source operating system what options are available to monitor their performance and availability? This presentation covers native Linux solutions to monitoring performance and collecting statistics...
متن کاملThe perfmon2 interface specification
Performance Monitoring Unit, PMU, performance tools, hardware counters, IPF, IA64 Linux, perfmon kernel interface Monitoring program execution is becoming key to achieving world class performance. All modern processors implement a sophisticated set of hardware performance counters to collect a lot of micro-architectural events which are important clues for software optimizations. Yet there is n...
متن کاملComparison and Analysis of Parallel Computing Performance Using OpenMP and MPI
The developments of multi-core technology have induced big challenges to software structures. To take full advantages of the performance enhancements offered by new multi-core hardware, software programming models have made a great shift from sequential programming to parallel programming. OpenMP (Open Multi-Processing) and MPI (Message Passing Interface), as the most common parallel programmin...
متن کاملPlatform Performance Comparison of PALM Network on Pentium 4 and FPGA
When simulating very large, biologically plausible models on desktop computers, the memory bandwidth is the biggest bottleneck due to the significant performance difference between memory and processor. We did the performance analysis for different variations of the Palm association network implemented on Pentium 4 with VTune 6.1 Performance Analyzer. We also analyzed the performance of an FPGA...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003